Theory and practice of ambiguity labelling with a view to interactive disambiguation in text and speech MT

نویسندگان

  • Christian Boitet
  • Mutsuko Tomokiyo
چکیده

In many contexts, automatic analyzers cannot fully disambiguate a sentence or an utterance reliably, but can produce ambiguous results containing the correct interpretation. It is useful to study vatious properties of these ambiguities in the view of subsequent total or partial interactive disambiguation. We have proposed a technique for labelling ambiguities in texts and in dialogue transcriptions, and experimented it on multilingual data. It has been first necessary to define formally the very notion of ambiguity relative to a representation system, as well as associated concepts such as ambiguity kernel, ambiguity scope, ambiguity occurrence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interactive Real-time Translation via the Internet

The Internet offers a tremendous opportunity for experiments with real-time machine translation (MT) dialogues. Interchanges may be typed, or speech recognition (SR) techniques might be used to accept spoken input. In either case, given the number of ambiguities still escaping automatic resolution, interactive disambiguation for both components seems likely to remain important in many Internet ...

متن کامل

A Context-Sensitive Homograph Disambiguation in Thai Text-to-Speech Synthesis

Homograph ambiguity is an original issue in Text-to-Speech (TTS). To disambiguate homograph, several efficient approaches have been proposed such as part-of-speech (POS) n-gram, Bayesian classifier, decision tree, and Bayesian-hybrid approaches. These methods need words or/and POS tags surrounding the question homographs in disambiguation. Some languages such as Thai, Chinese, and Japanese have...

متن کامل

An Interactive Disambiguation Module for English Natural Language Utterances

An interactive disambiguation methodology has been proposed and implemented at the GETA lab. in the framework of Dialogue-Based Machine Translation. This methodology has been generalized and re-engineered at ATR-ITL in the framework of spoken language transtlation and the MIDDIM project, a joint research between ATR-ITL and the GETA-CNRS aimed to study Multimodal Interactive Disambiguation. The...

متن کامل

Part of Speechtagger for Kannada

Parts of speech tagging is a well-understood problem in NLP. The importance of the problem focuses from the fact that the Parts of Speech tagging is one of the first stages in the process performed by various natural language related process. POS tagging is the process of assigning the part of speech tag or other lexical class marker to each and every word in a sentence. POS tagging has a cruci...

متن کامل

Comparison of Point of View based on Gérard Genette’s Focalization theory in Sophocles’ Ajax and Faulkner’s The Sound and the Fury

Sophocles’ Ajax is among the most significant plays in ancient Greek literature, and William Faulkner’s The Sound and the Fury is an important modern novel. This study considers these two works using Gérard Genette’s Focalization term in Narratology theory. Focalization is a feature of the point of view in narrative and is described by characters’ action, emotional reaction and speech. This stu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996